TASSER_low-zsc: an approach to improve structure prediction using low z-score-ranked templates.

نویسندگان

  • Shashi B Pandit
  • Jeffrey Skolnick
چکیده

In a variety of threading methods, often poorly ranked (low z-score) templates have good alignments. Here, a new method, TASSER_low-zsc that identifies these low z-score-ranked templates to improve protein structure prediction accuracy, is described. The approach consists of clustering of threading templates by affinity propagation on the basis of structural similarity (thread_cluster) followed by TASSER modeling, with final models selected by using a TASSER_QA variant. To establish the generality of the approach, templates provided by two threading methods, SP(3) and SPARKS(2), are examined. The SP(3) and SPARKS(2) benchmark datasets consist of 351 and 357 medium/hard proteins (those with moderate to poor quality templates and/or alignments) of length < or =250 residues, respectively. For SP(3) medium and hard targets, using thread_cluster, the TM-scores of the best template improve by approximately 4 and 9% over the original set (without low z-score templates) respectively; after TASSER modeling/refinement and ranking, the best model improves by approximately 7 and 9% over the best model generated with the original template set. Moreover, TASSER_low-zsc generates 22% (43%) more foldable medium (hard) targets. Similar improvements are observed with low-ranked templates from SPARKS(2). The template clustering approach could be applied to other modeling methods that utilize multiple templates to improve structure prediction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MULTICOM: a multi-level combination approach to protein structure prediction and its assessments in CASP8

MOTIVATION Protein structure prediction is one of the most important problems in structural bioinformatics. Here we describe MULTICOM, a multi-level combination approach to improve the various steps in protein structure prediction. In contrast to those methods which look for the best templates, alignments and models, our approach tries to combine complementary and alternative templates, alignme...

متن کامل

Dynamic Modelling & Controller Design for Z-Source DC-DC Converter

This paper presents the detailed mathematical modeling of Z-source dc-dc converter (ZSC) in continuous conduction mode. Transfer function of ZSC is derived based on mathematical modeling with state space averaging method. This paper has been focused on dynamic modeling of open loop transfer function of ZSC along with design of closed loop controller. MATLAB based simulation results are presente...

متن کامل

(PS)2: protein structure prediction server

Protein structure prediction provides valuable insights into function, and comparative modeling is one of the most reliable methods to predict 3D structures directly from amino acid sequences. However, critical problems arise during the selection of the correct templates and the alignment of query sequences therewith. We have developed an automatic protein structure prediction server, (PS)2, wh...

متن کامل

A machine learning information retrieval approach to protein fold recognition

MOTIVATION Recognizing proteins that have similar tertiary structure is the key step of template-based protein structure prediction methods. Traditionally, a variety of alignment methods are used to identify similar folds, based on sequence similarity and sequence-structure compatibility. Although these methods are complementary, their integration has not been thoroughly exploited. Statistical ...

متن کامل

Selecting Energy Efficient Poultry Egg Producers: A Fuzzy Data Envelopment Analysis Approach

This study examined the energy use pattern of poultry for egg production farms of Iran and ranked the selected farmers using fuzzy data envelopment analysis (FDEA) from the viewpoint of energy efficiency. Since data used in our study were not measured precisely, fuzzy forms of them could help us to reach the ideal situations. Hence, the conventional data envelopment analysis (DEA) was remod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proteins

دوره 78 13  شماره 

صفحات  -

تاریخ انتشار 2010